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Abstract 



We study a multi-marginal optimal transportation problem on a Rie- 
mannian manifold, with cost function given by the average distance squared 
from multiple points to their barycenter. Under a standard regularity con- 
dition on the first marginal, we prove that the optimal measure is unique 
and concentrated on the graph of a function over the first variable, thus 
inducing a Monge solution. This result generalizes McCann's polar fac- 
torization theorem on manifolds from 2 to several marginals, in the same 
sense that a well known result of Gangbo and Swiech generalizes Brenier's 
polar factorization theorem on R™. 

1 Introduction 

In this paper, we study an optimal transport problem with several marginals. 
Given Borel probability measures fi±, fj,2, /i m , each compactly supported on 
some smooth manifold M, and a cost function c : M m — >• WL, the multi- marginal 
optimal transport problem of Monge is to minimize 



J M 

among (m — l)-tuples of mappings {F2, ■■■,F m ), such that for each i, the map 
Ft : M i— > M pushes the measure jj>i forward to /i,-; that is, given any Borel 
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The Kantorovich formulation of the multi-marginal optimal transport prob- 
lem is the minimize 



among all probability measures 7 on the product space M m which project to the 
Hi\ that is, such that 7(M i " 1 x Ax M m ~ i ) = m(A) for all Borcl A C M and all 
i = 1,2, m. Under reasonable conditions (e.g. continuity of the cost function 
and compactness of the supports of the measures), it is fairly straightforward 
to assert the existence of a minimizer 7 in (jK|) . Note that, for any (F2, ...,F m ) 
satisfying the pushforward constraint in (|M[) . the pushforward measure 7 of [i\ 
by the map (Id, F2, Fm) ■ M — > M m satisfies the marginal constraint in fK]) 
and so ([K]) is a relaxed version of fMj . 

When m = 2, (jK|) and (|M[) correspond to the Monge and Kantorovich 
formulations, respectively, of the classical optimal transportation problem. This 
problem has been studied extensively over the past 25 years, and has many 
applications and deep connections to various areas of mathematics. Under a 
structural condition on the cost function, that is in particular satisfied by the 
Riemannian distance squared, and a fairly weak regularity condition on the first 
marginal, it is now well known that the optimal measure is concentrated on the 
graph {(xi, F2(xi)) : X\ <E M} of a map F2 : M — > M. The map F2 minimizes 
(jM]) . and both F 2 and 7 are unique [T7] [H] HH [IB]- 

In recent years, the multi-marginal case, m > 3 has attracted increasing 
attention, due to emerging applications in areas such as economics [5] [7] , physics 
M 0i cyclical monotonicity [T0j[T4l[T5] , and systems of elliptic equations [16]. 
In contrast to the two marginal case, the structure of solutions for general 
cost functions is not well understood. For costs with certain special properties, 
however, a number of authors have proven that, like in the two marginal case, 
the optimal measure 7 is concentrated on the graph of a function over x\ and is 
unique [13] [I] dJ [ID] [S] • On the other hand, it is known that for certain other 
cost functions, the optimal measure 7 in the m > 3 case may be concentrated on 
a set of Hausdorff dimension larger than n :=dim(M) and be non unique [B] [2"Tj . 
It remains an open question to determine precisely which costs permit Monge 
solution and uniqueness results and which do not. 

The most important cost in the two marginal case is the quadratic cost, either 
c(x\,X2) = \x\ — X2I 2 on M C R", or c(x\,X2) = d(xi,X2) 2 , where d denotes 
the distance on a Riemannian manifold M. Uniqueness and Monge solution 
results for these two cost functions constitute famous theorems of Brcnier [2] 
and McCann [18], respectively; these two results are at the heart of optimal 
transport theory and underlie many of it applications. 

The best known result on multi-marginal problems concerns the cost function 
\ x i ~ x j\ 2 011 Euclidean space, M C R n . For this cost, Gangbo and 
Swiech |13| proved uniqueness and existence of Monge solution, amounting to a 
generalization of Brenier's theorem from 2 to m > 3 marginals. 

The purpose of this paper is to generalize McCann's result on manifolds to 
the multi-marginal case, in the same spirit that Gangbo and Swiech's theorem 
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generalizes the result of Brenier. Alternatively, we can think of our main result 
as generalizing Gangbo and Swiech's result to manifolds, in the same sense that 
McCann's theorem generalizes Brcnier's. We will prove uniqueness and Mongc 
solution results for the multi-marginal problem on a compact Riemannian man- 
ifold, with cost function 

m d 2 

c(x 1 ,x 2 ,...,x m ) = inf Y] —(xi,y). (1) 

i— 1 

Let us note that, while the cost S»=4j d(xi, x j) 2 is a direct, algebraic gener- 
alization of the Gangbo-Swiech cost, the cost function ([T]) seems to be more 
intimately linked with the underlying geometry of the manifold M. This cost 
function measures the average distance squared between the points x\, X2, 
x m and their baryccnter (also known as the Frechet or Karcher mean). Al- 
ternatively, we can view (xi,-- - ,x m ) as a point in the Riemannian product 

M X • • • X M equipped with the product distance v/j^Li Tr( x ii Hi) an< i the 
cost function in ([1} measures the distance from (xi, • ■ • , x n ) to the diagonal set 
{(y, • • ■ ,y)}j, g A/. Straightforward calculations in [19j show that the cost func- 
tions of Brenier (when m = 2 on K"), McCann (when m = 2 on a Riemannian 
manifold) and Gangbo-Swiech (for general m on M. n ) are all equal to cost (TTJ 
in the appropriate settings, and so (JTJ) for m > 2 on a Riemannian manifold 
is indeed an extension of these costs. Furthermore, there is an intimate link 
between optimal transport with cost {1} and the notion of the barycenter of 
the measures fj,i, ^2, A*2 as considered by Agueh and Carlier pQ in the Eu- 
clidean case, following the work of Carlier and Ekeland [5] who considered more 
a general version of the cost function ([I]) in economic applications (see cost ((3]) 
below). 

Our main theorem is the following. 

Theorem 1.1. Assume [i\ is absolutely continuous with respect to local coordi- 
nates. Then the solution 7 to (|K|) with cost function ([T]) is concentrated on the 
graph of a mapping (_F2,f3, ■■■,F m ) over the first variable. This mapping is a 
solution to Monge's problem (|Mj) . and the solutions to both (|K|) and (|M|) are 
unigue. 

This theorem is, to the best of our knowledge, the first of its kind for multi- 
marginal problems on manifolds with topology different from R" . We also note 
that one can extend Theorem ll.ll to slightly more general costs, similar to ([1]): 
see Theorem 16. II 

The Gangbo-Swiech optimal maps are all compositions of Brenier maps, 
Vu* o Vui, where the u* and u\ are convex functions; this structure was further 
clarified by Agueh and Carlier [T], who showed that the first map, Viti, is the 
optimal map pushing /ii forward to the Wasscrstein barycenter (see (fTTjO of the 
measures fj,i, ...[i m , while the second map Vu* is the optimal mapping pushing 
the barycenter forward to the measure [ii pQ. Our optimizers here take the 
same form, but using McCann maps rather than Brenier maps to account for 



3 



the curved geometry: the optimizers are of the form 

Fi(xi) = exp exPaii(v „ l(xi)) (V<(exp Xi (V'Ui(a;i)))) (2) 

for ^--concave u\, u\\ that is, they are compositions of McCann maps (sec 
Proposition ^. ip . Therefore, our optimal map is built out of McCann maps, in 
the same sense that the Gangbo-Swiech maps are built from Brenier maps. 

Let us mention that multi-marginal problems with cost functions of the 
general form 

m 

c(xi,x 2 , ...,x m ) = inf ) Ci(xi,y) (3) 

v r-f 

i—i 

arise naturally in matching problems in economics as considered by Carlier 
and Ekeland [5] and Chiapporri, McCann and Nesheim. [7]. In this setting, 
solving the multi-marginal problem can be interpreted as finding an equilibrium 
assignment of m different groups of workers to contracts y. 

Motivated in part by this application, problems of costs of the form were 
studied by one of us in a recent paper |19j . and conditions on the c,'s were 
found that ensured existence and uniqueness of Mongc solutions [19] . However, 
that paper was restricted to Euclidean space, M C M. n , and required twist 
and a type of non-degeneracy conditions on the cost functions Cf, topological 
obstructions prevent these hypotheses from holding on, for example, a compact 
manifold. It should be noted, however, that the results in [T!5] do apply to 
the quadratic cost on bounded domains in a Hadamard manifold (a simply 
connected Riemannian manifold with nonpositive sectional curvature); in this 
case, the Cartan-Hadamard Theorem ensures that M has the topology of M. n . 

Our method for the proof of Thcorcm ll.ll is a bit different from the approach 
in [19] , which has its origin in the papers of Carlier and Ekeland and Aguch 
and Carlier pQ. In that work, it was crucial to have the absolute continuity 
of the (generalized) barycenter measure with respect to local coordinates, since 
the optimal maps F^s were constructed from Brenier theory applied to optimal 
maps from the barycenter measure, say v (see (jlip ). to the target measure /Ltj, 
whose existence requires absolute continuity of the source distribution v. In the 
Euclidean setting, the absolute continuity of v has been shown by Agueh and 
Carlier pQ, but its Riemannian extension is still not known except the case of 
nonpositively curved manifold |19j . In our method, we directly (without relying 
on absolute continuity of v) show that the solution 7 to (|K|) is concentrated on a 
graph of map. First, we will show that for fi\ almost every x\, there is a unique 
point y attaining the minimum in (TTJ), for x%, ■■■^Xm such that (xi,X2, ...,x m ) is 
in the support of the optimal measure 7 in l|K|). We will then show, in turn, that 
for each y, there is at most 1 corresponding point (xi, x m ) in the support 
of 7; together, these two facts will imply the main result. It is also remarkable 
that the structure of the maps Fi as in ([2]) has been obtained without relying on 
absolute continuity of v, while the maps exp y (Vui(y) in ([2]) are Brenier maps 
from v to fii. 
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In Section 2, we recall the dual problem to (|Kj) and some of its properties. 
In the third section, we will prove several lemmas which are needed for the proof 
of Theorem 1 while the Theorem itself will be proved in Section 4. In Section 
5, we discuss the connection of (|K|) and (|M[) with the Wasserstcin barycenter of 
the measures /ii, ...,/i m , showing the structure ^ of the maps Fi. In the final 
section, we extend our results to other functions of the distance on a Riemannian 
manifold. 

2 Duality 

Here we recall known results on the Kantorovich dual problem, and make some 
remarks on the properties of its solution which will be relevant later on. 
The dual problem to (|K|) is to minimize, 

m „ 

y~] / Ui(xi)dni(xi) (D) 

• J M 

among all m-tuples of functions («i,«2, ■■•,u m ) of functions, with Ui € L 1 ^) 
and 

m 

^Uj(xi) < c(xi,x 2 ,-.,x m ) (4) 

i=l 

for <8>™!^i almost everywhere point (xi, ...x m ). We will say that an m-tuple 
(ill, U2, u m ) is c-conjugate if, for all i = 1,2, m, we have 

Ui{xi) = inf [c(ari,a;2) —,x m ) - y^Uj(xj)] . (5) 

It is clear that any c-conjugate (ui,i«2, ■■■u m ) satisfies (|4]), so is a viable com- 
petitor in (|Pl) . 

The following result is well known: 

Theorem 2.1. There exists a c-conjugate solution [u\, 112, u m ) to |D|) . 1/7 

is any optimal measure in the Kantorovich problem, we have 

m 

^Ui(xi) = c(xi,X2, ,..,!») 
i=l 

"/-almost everywhere. 

It is also well known that if the cost c is Lipschitz and (u±, 1x2, u m ) is 
a c-conjugate m-tuple such that if m is not identically infinity, then u\ must 
be Lipschitz [18]. Our cost function ([1]) is Lipschitz, since it is defined as the 
infimum of a family of Lipschitz functions (x\, • • ■ , x m ) 1— > X)"=i ^( x i: v)- 
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3 Properties of the cost function and consequences 
of optimality of 7 

In this section, we establish several key lemmas for the proof of Theorem 11.11 
Throughout this section, c(x\, ...,x m ) is the cost function as in ([T]), 7 denotes 
an optimal measure of the Kantorovich problem (fKj) . and (ui, ...,u m ) is a 
conjugate solution to (|D]) . 



Lemma 3.1. Fix (x±, x m ). Then any y which minimizes y 1— > d 2 (xi,y) 
is not in the cut locus of Xi for any i. 

Proof. Choose a point y in the cut locus of Xi for some i; we will show that 
y cannot minimize y 1— > ^2f—id (xi,y). By Lemma 3.12 in [5], we can find a 
constant K such that, for all u € T y M, and j = 1, 2, ...to, we have 

d 2 (x j; exp m) + d 2 (x j7 exp (-u)) - 2d 2 (xj,y) 

1 12 ^ ( 6 ) 



On the other hand, by Proposition 2.5 in the same paper, we can find some non 
\d such that 

d 2 (ir i ,exp, y u) + d 2 (x t ,exp y (-u)) - 2d 2 (x i ,y) 



zero u € T y M such that 



< -mK. (7) 



Therefore, we have 

m m 

^2d 2 {xj,y) = ^2d 2 (xj,y) +d 2 (x t ,y) 
-{m-l)K\u\ 2 1 



> 



{d 2 (x 3l exp y u) + d 2 (x ] ,exp y (~u))) 



2 2 

mK\u\ 2 1 9 2 

2 ^2 ( a; i) ex Ps/ u ) + « l^exp^-ti)) 



> ^5Z d2 ^' exp « w ) + ^^d 2 {xj,e^ y {-u)). 
Therefore, either 



2^ v *v ' 2 

i=i j=i 



^V^exp^u) < ^2d 2 (x j ,y), 

or 

m m 

d 2 (xj , exp y -it) < ^ d 2 (xj , y) ; 
3=1 3=1 

in either case, y cannot minimize X^^li d 2 (xj, y). □ 
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Lemma 3.2. The cost function c is everywhere superdifferentiable with respect 
to x\. That is, for all (xi,X2, ■ ■■,x m ) € M m there exist p £ T Xl M (the super- 
gradient) such that, for small v £ T Xl M , we have 

c(exp a , i v,x 2 , —,x m ) < c(xx,x 2 , —,x m ) + g{p,v) + o(\v\), 

where g denotes the metric. 

Proof. Choose y minimizing y \-> J2i^t { x iiV)- By [IB], Proposition 6, the 
function x\ i— » d 2 (xi,y) is superdifferentiable. Letting p be it's supergradient, 
we have, for small v 

d 2 " 2 d 2 

— (exp Xl v,y) < —(x 1 ,y)+g(p,v) + o(\v\). 
Now, by definition, we have 



d 2 d 2 
c(exp Xi v,x 2 ,...,x m ) < — (exp xi v,y) + ^2—( Xi ,y) 

i=2 

d 2 m d 2 " 2 

< -j(xi,y) + g{p,v) + o(H) +^2~ {xi,y) 

2=2 

= c(xi,X2, ~.,x m ) + g{p,v) + o(\v\). 

Therefore, c is superdifferentiable with respect to x\, with supergradient p. □ 

Lemma 3.3. At any point (x\, ...,x m ) where c is differentiable with respect to 
X\, there is a unique minimizing y in (|Tj). and moreover, 

y = cxp Xi (S/ Xl c(x 1 ,...,x m )). (8) 

Proof. For any minimizing y in ([I}, d 2 (x\,y) is differentiable as y ^ cut{x\) by 
Lemma HOI We then have 

V Xl c(xi, ...,x m ) = \7 Xl (^d 2 (xi,y)). 

This equation implies that y must equal exp a , i (V a;i c(a;i, ...,x m )); uniqueness 
follows immediately. 

□ 

Remark 1. This lemma may be of independent interest. The minimizing y in 
([Tj) is known as the Frechet or Marcher mean of the points xi,x 2 , ...,x m , and 
can be interpreted as a generalization of the notion of average to a curved space. 
Uniqueness of Frechet means is linked the curvature of the manifold M . For 
example, Frechet means are easily seen to be unique on a Hadamard manifold, 
but may be non-unique for certain Xi on spaces with positive sectoinal curvature; 
for example, every point on the equator is a Frechet mean of the north and south 
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poles on the sphere. The preceding lemma yields a sufficient condition for the 
Frechet mean of the points X\, X2, x m to be unique; namely, that the function 
c be differ entiable with respect to one of the variables. We will see later that 
this in fact implies that, if [i\ does not charge small sets, the Frechet mean of 
the points (x\,X2, ...,x m ) is unique 7 almost everywhere, where 7 is the optimal 
measure in (|K1). 



Lemma 3.4. Let (xx, X2, x m ) £ spt{^), and suppose the potential u\ is 
differ entiable at x\. Then the cost c is differ entiable with respect to x\ at 
(xi, X2, x m ) and moreover, 

W Xl c(xi, ...,x m ) = Vui(xi) (9) 

Proof. The proof is based on a now classical argument of McCann [T8]. By 
the properties ((4]) and J5]), Thcorcm[2l and the fact that (x±, x m ) £ spt(j), 
the gradient q = Vmi(xi) serves as a subgradient for c with respect to x\. By 
lemma I3T41 c has a supergradient, p, as well. Therefore, we conclude that c must 
be diffcrentiable with respect to x\, with gradient p = q = Vui(xi). □ 

Whenever the optimal y in ([1]) is unique, we will denote it by y(x\, ...,x m ). 
The following proposition implies injectivity of this correspondence from the 
support of 7. 

Lemma 3.5. Suppose x = (x\,--- ,x m ) and x = (xi,--- ,x m ) are both in 
spt(j) and there exists y £ M which minimizes both z i-> X)I=i d>{pu z ) an d 
z i-> d 2 {xi, z). Then x = x. 

Proof. Fix some i. We will show xi = Xi. 

Define x' = (xi,-- - ,Xi,--- ,Xk) and x' = (x\,--- ,Xi,--- ,Xk)- Recalling 
that c(x) = inf zS M X) ^{ x u z )i we have 



c(x') + c(x')< [^ferij + yfejll^tij.rijlj^j) (10) 
= c(x) + c(x), 

where the last equality follows from rearranging the terms and using the as- 
sumption 

m d 2 m d 2 

V e argmin zeM ^ —(x u z) Qargmin zeM ^—(x^z). 

1=1 4=1 

Also, the optimality of 7 implies a certain c-monotonicity property |21j ; as 
x, x £ sptj, this yields, 

c(x') + c(x') > c(x) + c(x). 

Moreover, from (JTU]), this implies that c(x') = [J2j=ii ^( x jjV)] + T^'f)' 
that is, 



y e argmin zeM [(^ — ( Xj ,z)) + — {x u z) 

The Riemannian distance is smooth away from the cut locus, so, by Lemma 
13.11 each z \-> ^-(xi,z) and each z h- »• ^-(xi,z) is differentiable at z = y. 
Therefore, as y G axgminJX^ *r( x ji z )] and V e argmin^^, j(i 3 ,z)) + 
— {Sii, z)] , 

$^V y ^-(xj,y) = = ]T) V tf — (&j,2/) + V y y(^,y). 
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This implies 



2 y 2 

It is well known that for x cut(y), x = exp y (^V y d 2 (x, y)) and so the above 
implies 

Xi = cxp y (iv y d 2 (x 4 ,y)) = cxp y (iv y d 2 (i 4 , y)) = x t . 
as desired. □ 



4 Main result 

We are now ready to prove Theorcm ll.il 

Proof. We first consider the assertion that optimal measures must be concen- 
trated on a graph. 

For any optimal measure 7, we need to show that, for \i\ almost every x\, 
there is a unique (2:2, £3, x m ) such that (x\, X2, X3, x m )) G spt(jy). As the 
optimal measure 7 projects to fii, for fj,i a.e. x\, we must have at least one 
(#2,2:3, ...,x m ) such that (xi, X2, X3, ...,x m ) G spt("f). Therefore, it remain only 
to prove uniqueness of the (x2,xa, ...,x m ). 

As mentioned in Section 2, it is well known that the Kantorovich potential 
u\ is Lipschitz, and hence differentiable fi\ almost everywhere |18| . Now, at 
every point where the potential is differentiable, the cost c is differentiable with 
respect to Xi, by Lemma [3.41 Thus from ((HJ and (j9j), we see that there is a 
unique y minimizing (Q]), for all (xi,X2,X3, ...,x m ) G spt(j); namely 

V = cxp Xl (S7u( Xl )). 

It follows from Lemma 13.51 that for this y, there is at most one 

(x 1 ,x 2 , -;X m ) G spt{-y) 



with y € argmin^ d 2 (xi, y), showing the uniqueness of (x2, ...,x m ). Denote 
this point 

G{y) := (G 1 (y),G 2 (y),....G m (y)). 

Then, denoting Fi(xi) = Gi(exp Xi (Vu(ii))), Fi is a well defined //i-almost 
everywhere map with the desired properties. 

It remains to prove uniqueness of the optimal measure 7 and the optimal 
maps Fi ; our argument for this is standard in optimal transport theory. Suppose 
there are two distinct optimizers, 7 and 7; by the argument above, they are 
concentrated on graphs, (-^2,^3, F m ) and (F 2 ,F 3 , F m ), respectively. Now, 
by linearity of the Kantorovich functional, the interpolant ^7 + ^7 is also so 
optimal and must also be concentrated on a graph. This immediately implies 
that (F 2 , F3, F m ) = (F 2 , F3, F m ) almost everywhere, completing the proof. 

□ 



5 Barycenters in Wasserstein space on Rieman- 
nian manifolds 

In this section, we discuss the connection of our result with the Wasserstein 
barycenter of fj,\,...fx 2 . The barycenter (with equal weights) is defined as the 
Borel probability measure v on M which minimizes 

in 

v^J2 W i(^>")> ( n ) 

i=l 

where W^ifii, v) denotes the square of the quadratic Wasserstein distance from 
\ii to v. The relevance of the barycenter to the multi-marginal optimal mapping 
problem (|M[) was observed and investigated by Carlier and Ekeland [5] and 
by Agueh and Carlier pQ. Existence of the barycenter follows easily from a 
continuity compactness argument (see [5], Theorem 3). When fi\ is absolutely 
continuous, uniqueness of the barycenter was shown in [1] for the Euclidean 
case M C K™ , and can be easily generalized to the Riemannian case either from 
Theorem 3.4.1 in [22], or alternatively, by adapting Proposition 4 in [5], using 
McCann's theorem [TH] in place of the twist, or generalized Spence-Mirrlees, 
condition. 

Letting 7 denote the optimal measure in the multi- marginal problem (|K]h 
and y(xi,X2, x m ) the minimizer of y h-> YH=i d 2 {xi,y) (which, by Lemma l3~3l 
is unique for 7 almost all (xi, x 2 , x m )), a result of Carlier and Ekeland [5] 
implies that 



is the unique barycenteiQ- They also showed that, for each i, the measure 7, 



x Note that the minimizer of y t-> d 2 (xi, y) is not necessarily unique for all 

(asi, X2, Xm) here, as was assumed by Carlier and Ekeland in [5]. However, it is clear by 
examining their proof that it is sufficient to have uniqueness for 7 almost all (xi,X2, ...,x m ). 
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defined on M x M by 



7» = (7T;,y)#7 

is optimal for the two marginal Monge-Katorovich problem 

inf f d 2 {x i ,y)d li (12) 

7i JMxM 

with marginals fj,i and v. Here 7^(2:1, x%, x m ) = Xj is the canonical projection. 

The following proposition further highlighs the relationship between the 
barycenter and the optimal maps in Theorem ll.il 

Proposition 5.1. Use the assumption and notation as in Theorem \ 1 . 1[ Then, 
the mappings Fi(x\) in Theorem \1.1\ are of the form Gi o G^ 1 , where G( l is 
the optimal map (for the quadratic cost d 2 (xi , y) ) pushing [i\ forward to the 
barycenter v , and Gi is the optimal map pushing v forward to the measure Hi . 

Remark 5.2. This generalizes the result of Gangbo and Swiech who showed 
that when M C M", the optimal maps for (|M[) take the form Vu* oVui, for con- 
vex functions m and their conjugates u* . Agueh and Carlier later realized that 
the maps Du* were in fact the Brenier (optimal) maps pushing the barycenter 
forward to fXi, and Du\ the Brenier map pushing fii forward to the barycen- 
ter 

Proof. Letting (ui, ita, ...,u m ) be a c-conjugate solution to the dual problem 
JD|), we have 



Ui(xi) = inf 



inf 



c(xi,X 2 , ...,X m ) - y^Mj(Zj) 



y £1 2 



= inf 

y 



— (x j ,y)-u j (x j ) 



(13) 
(14) 
(15) 



This implies that m is ^--concave. Setting Vi(y) = — JZj-^infxj ^( x jiV) 
Uj(xj), we have 

d 2 



Ui{xi) + Vi{y) < —(xi,y) 



(16) 



and we have equality when 



x G 5 := {a; = (xi, ...,x m ) : c(xi,x 2 , ...,a; TO ) - ^ Mf(%)}, 
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that is, (ui,Vi) solve the Kantorovich dual problem to (|12p . 

Now, from the proof of Theorem ll.il we have Fi(x\) = Gj(exp a . 1 (Vui(xi)) 
(for a.e x\ thus, for /xi-a.e x\ by our assumption that fXi is absolutely continuous 
with respect to local coordinates). Also, from (fT5|) . u% is ^--concave, therefore 
by McCann's theorem the map exp^ (Viti(xi)) is optimal. As for /ii-a.e. x± 
we have equality in (|16p only when y = y(x±, ^(xi), F m {xi)), it is clear that 
exp a . 1 (Vui(xi)) # /ii = y # 7 = !/. 

It remains to show that the maps Gi are optimal maps pushing v to fM- 
Let u£(i/) := inf^gM (x^, y) — Wj(xj) be the 4j- concave conjugate of Ui. Note 
that we have Vi(y) < and we have equality whenever there is some Xi 

achieving equality in (|16p , which happens whenever y = y(x) for some x E S. 

Standard arguments now imply that Gi(y) = exp y (Vu^(y)) wherever u\ is 
differentiable; we must show that this is z/-almost everywhere. Although the 
semi-concave function u\ is differentiable almost everywhere with respect to 
local coordinates, the conclusion is not completely obvious, as we do not know 
that the measure v does not charge small sets. (This latter fact is known for 
the Euclidean case, or cquivalently for the Gangbo-Swi§ch cost Y^ijtj \ x i~ x j\ 2 i 
as was shown by Agueh and Carlier [I].) This difficulty can easily be overcome 
thanks to Lemma T3. 5 1 Details follow: 

Fix y £ y(S). (Notice that for i/-almost everywhere points y, we have y G 
y{S).) Note that the c-concave function u\ is supcrdiffcrentiable everywhere 
(since the relevant domains, i.e. support of measures, etc, are compact), and 
that at any point xi where 

d 2 

Ui(xi) + ul(y) = —(xi,y), 

the vector V y 4>(xi,y) is in the superdifferential du1{y). Also, as is well-known 

for concave and for c-concave functions, we see that at any a;,, if V y 4j- (2^ , y) is 
an extremal point of the convex set du^(y), we must have 

d 2 

Ui(xi) + u?{y) = —(xi,y). 

Furthermore, as for y £ y{S) we have v(y) = u?(y), we have equality in (|16[) for 
each such Xj. 

In sum, for each y £ y(S) we have shown that for each Xi, if Vj,4-(xj,j/) is 
extremal in duf(y), then we have equality in (|16[) . But by Lemma 13.51 there is 
exactly one point x, € M for which equality holds in P^|) . It follows that, at each 
point y £ y(S'), du1{y) has only one extremal point, hence is differentiable 
on y(S). Since y £ y(S) for j/-a.c. points y, it now follows that 

Gi(y) = exp y (V<(y)) 

for v-a.e. y and so that 

Fi(xx) = cxp cxPxi Vui ( xl )(Vul(exp xl Vwi(xi))) 
for fii-a.e. x\ as desired. □ 
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6 Extension to other functions of the distance 



In this section we consider the extension of our main theorem to non-quadratic 
cost functions. Precisely, given C 2 , strictly increasing, strictly convex functions 
fi : [0, oo ) — > R, for i = 1, 2, m, we consider cost functions of the form 

m 

c(xi,x 2 , —,x m ) = hif y~]fi(d(xi,y)). (17) 

Note that the previous part of the paper deals with the case fi (t) = t 2 . 

Theorem 6.1. Assume fix is absolutely continuous with respect to local coor- 
dinates. Then the solution 7 to (|Kj) with cost function (JTTJ) is concentrated on 
the graph of a function (i^-Fh, ...,F m ) over the first variable. This function is 
a solution to Monge's problem (|M[) . and the solutions to both (TK| and (|M[) are 
unique. 

Proof. The proof is very similar to the proof of Theorem 1 1.1[ and we only sketch 
it here, explaining how to deal with the main differences. The main difference 
is to verify that a minimizer y € argmin^"^ fi(d(xi,y)) is not in the cutlocus 
of any Xi (an analogue of Lemma 13. 1|) . This follows from a Taylor expansion. 
Details arc given below: 

Note that, by Taylor's theorem, for some p £ [d(xi, exp y (u)), d(xi, y)] 



fi(d(xi,exp y (u))) - fi(d(xi,y)) (18) 
f-(d(x t ,y))(d(x l ,exp y (u)) - d(x l ,y)) 

(d(xi,exp y (u)) - d{xi,y)) 2 



/roo- 



fi{d(xi,y)) 



d(xi,exp y {u)) + d(xi,y) 



(d 2 (xi,exp y (u)) - d 2 {xi,y)) 



,„, Sd{x u exp y {u)) - djx^y)) 2 
' Ji \P) 2 ' ' 

Similarly, for some q € [d(xi,exp y (—u)),d(xi,y)} 

fi(d(xi,exp y (-u))) - fi{d(xi,y)) (20) 

-(d^expyi-uV-d^y)) 



d(xi,exp y (-u)) + d(xi,y) 

, (d(xi,exp (-u))-d(xi,y)) 



2 



+ /fOO - ^ V " f (21) 

By monotonicity of fi, we have fl(d(xi,y)) > and by convexity we have 
fi(p),fi(q) > 0. Also, we have (e.g. see [18]), 

(d{xi,exp y (-u)) - d(x i: y)f < C\u\ 2 + o(\u\). 
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Note that near the cutlocus, the factor d(xi, exp y (u)) + d(xi, y) is bounded below 
by /, the injectivity radius of the compact manifold M (away from the cutlocus, 
inequality (|2"2j) below follows easily from smoothness). Using Lemma 3.12 in [5], 
and combining (|T9")) and (f2"Tj) . we obtain 

/i(d(xi,exp y (u))) + fi(d(xi,exp y (-u))) - 2f l {d(x l ,y)) < K\u\ 2 + o(\u\) 2 (22) 

for some K > 0. Now, if y £ cut(xj), we can, by Proposition 2.5 in [8], for any 
A > 0, choose a u such that 

d 2 (xj,cxp y u) + d 2 (x J ,cxp y (-u)) - 2d 2 (xi,y) < -A\u\ 2 . (23) 

Now, the factor d(xi, exp y (u)) + d(xi, y) is bounded above by 2R, where R is 
the radius of the compact manifold. Therefore, for any B > 0, combining (| 
and (f2"Tj) and ([2^)1 . we can find arbitrarily small u such that 

fi(d( Xj ,exp y (u))) + fi(d( Xj ,exp y (-u))) -2fi(d( Xj ,y)) < -B\u\ 2 + o(\u\) 2 . 

Now, choosing B large enough, and arguing as in Lemma |3.1[ we obtain either 

m m 

^2fi(d(xi,exp y (u))) < ^2fi(d{x l ,y)) 
i=i »=i 

or 

m m 

Y^fi(d(xi,exp y (-u))) < ^2fi{d(x l: y)). 

i=l i=l 

In either case, y ^ argmin^™^ fi(d(xi,y)). The remainder of the proof, 
including analogues of Lemmas 13.31 and 13.51 is very similar to the arguments 
developed earlier in the paper and is omitted. □ 
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